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Abstract 

Motivated by a concrete problem and with the goal of understanding the sense in which the complex- 
ity of streaming algorithms is related to the complexity of formal languages, we investigate the problem 
Dyck(s) of checking matching parentheses, with s different types of parenthesis. 

We present a one-pass randomized streaming algorithm for Dyck(2) with space 0(^/nlogn), time 
per letter polylog(n), and one-sided error. We prove that this one-pass algorithm is optimal, up to a 
polylog n factor, even when two-sided error is allowed, and conjecture that a similar bound holds for 
any constant number of passes over the input. 

Surprisingly, the space requirement shrinks drastically if we have access to the input stream in re- 
verse. We present a two-pass randomized streaming algorithm for Dyck(2) with space 0((logn) 2 ), 
time poly log (n) and one-sided error, where the second pass is in the reverse direction. Both algorithms 
can be extended to Dyck(s) since this problem is reducible to Dyck(2) for a suitable notion of reduc- 
tion in the streaming model. Except for an extra 0(v / log s ) multiplicative overhead in the space required 
in the one-pass algorithm, the resource requirements are of the same order. 

For the lower bound, we exhibit hard instances AsCENSlON(m) of Dyck(2) with length 6 (ran). 
We embed these in what we call a "one-pass" communication problem with 2m-players, where m = 
O(n). To establish the hardness of AsCENSlON(m), we prove a direct sum result by following the 
"information cost" approach, but with a few twists. Indeed, we play a subtle game between public and 
private coins for MOUNTAIN, which corresponds to a primitive instance Ascension(I). This mixture 
between public and private coins for MOUNTAIN results from a balancing act between the direct sum 
result and a combinatorial lower bound for MOUNTAIN. 
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1 Introduction 



The area of streaming algorithms has experienced tremendous growth over the last decade in many appli- 
cations. Streaming algorithms sequentially scan the whole input piece by piece in one pass, or in a small 
number of passes (i.e., they do not have random access to the input), while using sublinear memory space, 
ideally polylogarithmic in the size of the input. The design of streaming algorithms is motivated by the ex- 
plosion in the size of the data that algorithms are called upon to process in everyday real-time applications, 
for example in bioinformatics for genome decoding, in Web databases for the search of documents, or in 
network monitoring. The analysis of Internet traffic 0, in which traffic logs are queried, was one of the first 
applications of this kind of algorithm. Few applications have been made in the context of formal languages, 
which may have impact on massive data such as DNA sequences and large XML files. For instance, in the 
context of databases, properties decidable by streaming algorithm have been studied ll24l l23l . but only in 
the restricted case of deterministic and constant memory space algorithms. 

Motivated by a concrete problem and with the goal of understanding the sense in which the complex- 
ity of streaming algorithms is related to the complexity of formal languages, we investigate the problem 
Dyck(s) of checking matching parentheses, with s different types of parenthesis. Regular languages are 
by nature decidable by deterministic streaming algorithms with constant space. The Dyck languages are 
some of the simplest context-free languages and yet already powerful. The Dyck(s) language plays a 
central role in context-free languages, since every context-free language L can be mapped to a subset of 
Dyck(s) 19]. In addition to its theoretical importance, the problem of checking matching parentheses is 
enountered frequently in database applications, for instance in verifying that an XML file is well-formed. 

Deciding membership in Dyck(s) has already been addressed in the massive data setting, more pre- 
cisely through property testing algorithms. An e-property tester ||6l |7J [T2] for a language L accepts all 
strings of L and rejects all strings which are e-far from strings in L, for the normalized Hamming distance. 
For every fixed e > 0, Dyck(1) is e-testable in constant time Q], whereas in general Dyck(s) are e-testable 
in time 0(n 2 / 3 ), with a lower bound of ^(n 1 / 11 ) ll2"Tll . In ifTTj . a comparison between property testers and 
streaming algorithms has been made. One advantage of streaming algorithms is that they have access to the 
full string, albeit not in a random access fashion. 

With random access to the input, context-free languages are known to be recognizable in 
space 0((logn) 2 ) |T3l . In the special case of Dyck(s), logarithmic space is sufficient, as we may run 
through all possible heights, and check parentheses at the same height. This scheme does not seem to easily 
translate to streaming algorithms, even with a small number of passes over the input. 

In the streaming model, Dyck(1) has a one-pass streaming algorithm with logarithmic space, using a 
height counter. Using a one-way communication complexity argument for EQUALITY, we can show that 
Dyck(2) requires linear space for deterministic one-pass streaming algorithms. A relaxation of Dyck(s) 
is Identity(s) in the free group with s generators, where local simplifications aa = e are allowed in 
addition to aa = e, for every type of parenthesis (a, a). There is a logarithmic space algorithm for recogniz- 
ing the language Identity(s) [18] that can easily be massaged into a one-pass streaming algorithm with 
polylogarithmic space. Again, this algorithm does not extend to Dyck(s). 

We show that Dyck(s) is reducible to Dyck(2), for a suitable notion of reduction in the streaming 
model, with a log s factor expansion in the input length. Our first algorithm is a one-pass randomized 
streaming algorithm for Dyck(2) with space 0(y/nlogn) and time polylog(n) (Theorem]!]). If we had 
no space constraints the algorithm would be very simple: when we encounter an upstep (a or b), push it 
on a stack, when we encounter a downstep (a or b), pop the top item from the stack and check that they 
match. However the stack may grow to linear size in this process. To avoid this growth, the basic principle 
of our algorithm is that instead, we use a linear hash function to periodically (every ^fn letters) compress the 
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information. As long as we compress only upsteps or only downsteps, all at different heights, we are able to 
detect mismatches with high probability. The algorithm has one-sided error; it accepts words that belong to 
the language with certainty. Although simple, we show that this appealing algorithm is nearly optimal in its 
space usage, even when two-sided error is allowed (Corollary [l]). 

We conjecture that our lower bound still holds if we read the stream several times, but always in the 
same direction. Surprisingly, the situation is drastically different if we can read the data stream in reverse. 
We present a second algorithm, a randomized two-pass streaming algorithm for Dyck(2) with 0((log n) 2 ) 
space and time polylog(n), where the second pass is in the reverse direction (Theorem|2]). This algorithm 
uses a hierarchical decomposition of the stream into blocks; whenever the algorithm reaches the end of 
a block, it compresses the information about subwords from within the block. This compression is what 
reduces the stack size from yjn down to O(logra), but prevents us from checking that certain pairs of 
parentheses are well-formed. However, given the profile of the word (i.e., the sequence of heights), we can 
pinpoint exactly the pairs that do not get checked. As it turns out, a pair that does not get checked when 
scanning the input left to right will necessarily be checked when scanning in the reverse direction. Like the 
one-pass algorithm, this algorithm has only one-sided error, and always accepts words that belong to the 
language. We note that it is easy to extend the algorithms so that it recognizes the language of substrings 
(which are subsequences of consecutive letters) of Dyck(2). 

As mentioned above, we also investigate the lower bound on the space required for any one-pass ran- 
domized streaming algorithms. Such a lower bound is by nature hard to prove because of the connection 
of the problem with Identity(2). Moreover, proving any lower bound based on two-party communication 
complexity is hopeless: the related communication problem automatically reduces to EQUALITY after local 
checks and simplifications by both players, thereby proving only an S7(log n) lower bound. Instead, we build 
hard instances AsCENSlON(m) of Dyck(2) with length 0(mn), that we embed in a "one-pass" commu- 
nication problem with 2m players, where m = 0(n). The constraint is that the length of each message in 
the protocol be less than size, a function of n. Our main result (Theorem [4]) is that such a protocol requires 
size = J7(n), which proves that our one-pass algorithm is nearly optimal (Corollary [I]). 

To establish the hardness of AsCENSlON(m), we prove a direct sum result that captures its relationship 
to solving m instances of the intermediate problem MOUNTAIN, which involves only two players. We 
follow the "information cost" approach taken in (8l|22l|4j[T7l[l5l, among other works before and since. We 
adapt this notion to suit both the nature of streaming algorithms and of our problem. The idea is to focus on 
the information about a part of the input contained in a part of the protocol transcript, given the remaining 
inputs. 

Using this notion of information cost, we prove a direct sum result (Lemma [3]). A remarkable device 
here is the use of an "easy" distribution for the information cost for protocols, that are correct with high 
probability in the worst case. The use of an easy distribution "collapses" AsCENSlON(m) to an instance 
of MOUNTAIN, which may be planted in any one of the m coordinates. This technique was developed 
in |4J, but comes with a few twists in our case. Indeed, we play a subtle game between public and private 
coins. Namely, in protocols for AsCENSlON(m) only public coins are allowed for all players, whereas for 
MOUNTAIN one of the players, Bob, can also access private coins, while Alice, the other player, cannot. 
This mixture between public and private coins for MOUNTAIN arises from a balancing act between the 
direct sum result and our combinatorial lower bound for MOUNTAIN (Theorem [3]). Namely, we are only 
able to prove the lower bound for MOUNTAIN when Alice only uses public coins, whereas the direct sum 
only holds, with our definition of information cost, when Bob has access to additional private coins. 

We note that as a bonus, our lower bound provides a ^l(^/n) lower bound for the problem of checking 
priority queues in the one-pass streaming model, solving an open problem of [10]. 
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2 Definitions and preliminaries 

Here is a formal definition of the language of parentheses with s types of parenthesis. 

Definition 1 (Dyck). Let s > 1 be an integer and let £ = {ax,a\, . . . ,a s ,a s }. Let Dyck(s) be the 
language over £ defined recursively by Dyck(s) = e + aiDYCK(s)ai + . . . + a s DYCK(s)a s . 

We also denote by Dyck(s) the problem of deciding if w is in the language Dyck(s), given the word 

w G £*. 

We recall the notion of streaming algorithms, where one pass on an input iEE" means that x is given 
as an input stream x\ , x^ , . . . , x n , which arrives sequentially, i.e., letter by letter in this order. For simplicity, 
we assume throughout this article that the length n of the input is always given to the algorithm in advance. 
Nonetheless, all our algorithms can be adapted to the general case where n is unknown until the end of a 
pass. For an excellent introduction to streaming algorithms, we refer the reader to the book lfl9l . 

Definition 2 (Streaming algorithm). Fix an alphabet S. A k-pass streaming algorithm A with space s(n) 
and time t{n) is an algorithm such that for every input stream x G S n : ( 1 ) A performs k sequential passes 
on x; (2) A maintains a memory space of size s(n) while reading x; (3) A has running time at most t{n) per 
letter xi; (4) A has preprocessing and postprocessing time t(n). 

We say that A is bidirectional if it is allowed to access to the input in the reverse order, after reaching the 
end of the input. Then the parameter k is the total number of passes in either direction. 

Definition 3 (Streaming reduction). Fix two alphabets Si and £2- A problem Pi is f(n)-streaming 
reducible to a problem P2 with space s(n) and time t{n), if for every input x G there exists 

V = yW2 ■ ■ - Vn> with yi G S^ n \ such that: (1) yi can be computed deterministically from xi using space 
s{n) and time t(n); (2) From a solution of P2 with input y, a solution on P± with input x can be computed 
with space s(n) and time t(n). 

Fact 1. Let Pi be f(n) -streaming reducible to a problem P2 with space so(n) and time to(n). Let A be 
a k-pass streaming algorithm for P2 with space s(n) and time t(n). Then there is a k-pass streaming 
algorithm for Pi with space s{n x /(n)) + So( n ) an d time t{n x f{n)) + to(n) with the same properties as 
A ( deterministic/randomized, unidirectional/bidirectional). 

Proposition 1. Dyck(s) is [log s\-streaming reducible to Dyck(2) with space and time 0(log s). 

Proof. We encode a parenthesis by a word of length I = [log s] with only parenthesis eti, 0,2 as follows. 
We let f(di) be the binary expansion of i over I bits where is replaced by ai and 1 by a2- Then /(a,) is 
defined similarly, except that we write the binary expansion of i in the opposite order. Then x% . . . x n is in 
Dyck(s) if and only f(xi) . . . f(x n ) is in Dyck(2). □ 

Note that in most often cases, such as XML files, the above reduction can be implemented within con- 
stant space an time. Indeed, given an upstate (start-tag) <w> (resp. an downstep (end-tag) </w>), where w 
is an ASCII string denoting the type of the parenthesis (tag), we can generate the above encoding of w into 
01, ei2 (respectively, into a\, 0,2), while reading w as a stream itself, i.e., character by character. 

By Proposition [TJ it is enough to design streaming algorithms for Dyck(2). That is the objective of the 
next section. 
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3 Algorithms 



Throughout this section we consider Dyck(2) where the input is a stream of n letters X1X2 ■ ■ ■ x n in the 
alphabet £ = {a, a, b, b}. We first introduce a few definitions. 

Definition 4 (Height, Matching pair, Well- formed). Let x G £ n . 

An upstep ( respectively, downstepj is a letter a or ab ( respectively, a or b). 

The height of x is height(x) = \x\ a + — \x\a — |x|r. 

For 1 < i < j < n, the pair is a matching pair for x i/height(a;[l,i]) = height(x[l,j — 1]) and 
height (x[l, k]) > height(x[l, i])for all k G {i + 1, . . . ,j -2}. 
The height of a matching pair (i, j) is height(x[l, i]). 

A matching pair (i,j)forx is well-formed, if(x[i], x[j]) equals (a, a) or (b,b). 

These definitions are extended to subsets / C [l,n] of indices of letters of x. For instance, we say 
that / is a matching set for x, if I = U{i,j}, where the union is over a subset of the matching pairs 
for x. For ease of notation, we identify an increasing sequence i\ < %2 < ■ ■ ■ < i m of indices with the 
corresponding sub-word x^, Xi 2 , . . . , xi m of x. We also use this correspondence in reverse when the indices 
of the sub- word are clear from the context. Last we need a weaker condition than well-formedness: 

Definition 5. Let x G S ra , / C [n] and I G {a, b}. Then I is /-balanced if for all d > 0, 

\{i £ I : Xi = Z,height(x[l,i]) = d}\ = \{i e I : Xi = I, height(x[l, i - 1]) = d}\ . 

Moreover I is balanced if it is both a-balanced and b-balanced. 

We now give a well-known characterization of Dyck(2). 

Fact 2. Let x G S n . Then x G Dyck(2) if and only if: the height of every prefix of x is nonnegative, 
height (x) = 0, and [1, n] is a well-formed set for x. 

During the computation the algorithm implicitly keeps track of the height of the word read so far. Let 
p be a prime number such that n 1+c < p < 2n 1+c , for some fixed constant c > 1. We assume that the 
algorithm has access to a random function hash mapping subsequences of x to integers in [0,p — 1], as 
follows: hash(xj 1 Xj 2 . . . Xj m ) = ^ - hash(xj j ), with 



where a is a uniformly random integer in [0, p — 1] . 

The value of hash(x) is a polynomial in a of degree bounded by the maximum height of a prefix, which 
is at most n. Therefore if h is not identically zero, by Schwartz's lemma, for a random a the probability 
that hash(x) = is at most n/p < rT c . Therefore we get the following characterization of balanced 
subsequences: 

Fact 3. Let x G S n , and let v = x^Xi 2 . . . be a subsequence of letters of x. If v is a balanced set for x, then 
hash(w) = Ofor all a; otherwise hash(u) = with probability at most n~ c , for a uniformly random a. 

For any letter Vi, we may compute hash(t;j) in time polylogn and space O(logre). Moreover, for any 
word v the value of hash(v) may be maintained with 0(log n) space. 




aght(xii,.j) mod p if xi = a, 

hei g ht(x[i,i-i}) mod p if Xi = a, 



otherwise 
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3.1 The one-pass algorithm 

The algorithm is easiest to understand if x = x^x^ d \ where x^ u > has only upsteps and a;W has only 
downsteps, in equal numbers. To check x( u )x( d ) g Dyck(2), the naive algorithm would grow a stack of size 
n/2. Here is a simple alternative. We read the input in blocks of length ^fn. While our algorithm is reading 
letters of p"), the stack stores the values of hash(x[i\/n + 1, (i + 1)-^/^-]) for each i G {1, . . . , y / n/2} and 
notes that height (x[iy / n + 1, (i+l)y/n\) = y/n. While the algorithm is reading x^ d \ it adds h.ash.(x\j y/n+ 
1, (j + l)yfn\) to hash(x[iy / n + 1, (i + l)^/n\) for j = y/n — i — 1, and checks if their sum is 0. The input x 
is ill-formed if any of the sums is non-zero. Our algorithm is a generalization of this stack compression idea. 

For any downstep Xj, our algorithm, given (h,£) = (hash(w), height (v)), can easily compute 
hash(vxj) = h + hash(xj) and height (vxj) =1—1, without explicit knowledge of v. Note that this 
relies on the linearity of our hash function. 

Algorithm [l] reads the stream in blocks of *Jn letters. It uses a stack data structure encoding the prefix 
formed by all the letters seen so far but whose matching pairs have not yet been checked. 

Algorithm 1 One -pass algorithm 
S <— empty stack 
for i <— 1 to yfn do 

Algorithm [2] with S {which consumes ^Jn letters} 
end for 

if S not empty, reject: "missing closing parenthesis" 
return accept 



For clarity of exposition, we describe an "off-line" version of Algorithm [2] that processes the letters in 
a block of size ^fn after the entire block has been read. With little additional effort, it can be converted to 
an "online" algorithm that takes polylog n time per letter. 

Within a block, Algorithm [2] reads the letters one by one, doing the obvious checks (with the straight- 
forward algorithm that uses a linear-size stack). After simplifying the pairs that have been checked, the 
block is reduced to a word in (a + b)*(a + b)*, consisting of a sequence w' of only downsteps followed by a 
sequence w" of only upsteps. To retain needed information about the blocks that have already been scanned, 
the algorithm uses a stack data structure. Each stack item is a pair of the form (h, £) encoding a subsequence 
v of letters of the stream x such that h = hash(z;) and £ = height (u ). Recall that the computation of 
hash(w) depends on the height of its starting point within x. 

As the algorithm processes the letters in w', it incorporates information about them into the last stack 
item, until the associated subsequence has height 0. At that point, to test whether v is well-formed, the 
algorithm checks whether hash(w) = 0. If this test succeeds, the entry of the stack encoding v can now be 
removed. The subword w" is processed in a straightforward manner by creating a new stack item associated 
with w". An example execution of the algorithm is shown in Appendix [A] 
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Algorithm 2 One -pass subroutine: reading one block 
{uses an input stack S} 

read the word w consisting of the next y/n letters (or less if the stream becomes empty) 
check that matching pairs within w are well-formed (if not, reject: "mismatched parentheses") 
simplify w into w'w", where w' has only downsteps and w" has only upsteps 
for i «— 1 to \w'\ do 

pop (h, £) from S (if empty, reject: "extra closing parenthesis") 
{(h,£) encodes some subsequence v: h = hash(v) and £ = height(u)} 

£<-e-i 

h <— h + hash(u^) 
push (h, £) on S 
{(h,£) now encodes vw'j} 
if I = then 

check that h = (if not, reject: "mismatched parentheses") 
pop and discard (h, £) 
end if 
end for 

push (hash(w"), \ w"\) on 5 
{(hash(u/'), \w"\) encodes w"} 
return the stack S 



We first start with the following observation about Algorithm [T] 

Define an order between words by taking the transitive closure of uv -< ull'v, where I, V G {a, b}, i.e. 
w -< x if w is a subsequence of x obtained by removing some (well-formed or not) matching pairs in x. 

Fact 4. Consider the stack right after a new push of an item encoding a subsequence ending with xj. Let 
vi,V2, ■ ■ ■ ,v m be the subsequences of letters of x encoded by the current stack items (bottom-up order). 
Then every V( has positive height, and ViVz . . . v m ^ x[l, j\. 

The above fact can be used to prove the following useful invariant of Algorithm [l] 

Fact 5. Let (h, £) be a stack item encoding some subsequence v of x. Then h = hash(v) and I = 
height (v) > 0, and v = v u Vd, where v u has only upsteps, has only downsteps. For all j E there 
is a unique i G v u such that (i, j) is a matching pair for x. Moreover £ = holds only for an item on top of 
the stack. 

Then, we conclude with the correctness of our algorithm. 

Theorem 1. Algorithm |7] is a one-pass randomized streaming algorithm for Dyck(2) with space 
0(i/nlogn) and time polylog(n). If the stream belongs to Dyck(2) then the algorithm accepts it with 
probability 1; otherwise it rejects it with probability at least 1 — n~ c . 

Proof. In terms of space requirements, each stack element takes space O(logn) and each execution of 
Algorithm [2] adds at most one element to the stack, for a total of at most ^fn stack items, hence space 
0(y/nlogn). The processing time is easy by inspection. 

To prove correctness, first assume that x G Dyck(2). By Fact [2] the height of prefixes are all non- 
negative, so the algorithm does not reject because of an extra closing parenthesis; and the height of x is 0, so 
the algorithm doesn't reject because of a missing closing parenthesis. For each block it;, the matching pairs 
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within w are all well-formed, so the algorithm doesn't reject them either. Finally, whenever the algorithm 
checks h = for a stack item such that £ = 0, by Fact [5] the corresponding encoded subsequence v is 
a matching set for x since x G Dyck(2), so v is balanced. Then by Fact|5} it passes the hash test in 
Algorithm |2| Therefore the algorithm is correct in this case. 

Second, assume that x Dyck(2). By Fact[2j x fails to be in Dyck(2) for one of the following reasons. 
Either some prefix of x has negative height (too many closing parentheses): then the algorithm detects the 
problem when it tries to pop an item from an empty stack, hence is correct. Or, the final height of x is 
non-zero: then the algorithm detects the problem at the very end when it sees that the stack is not empty, 
hence it is correct. Or, there is a matching pair (i, j) where x is not well-formed: that is the only non trivial 
case. If i, j are within the same block, then the algorithm rejects during the internal checks within the block. 
Assume now that i and j are in different blocks, and that the algorithm accepts x. Since the stack is empty 
at the end of the algorithm, at some point the stack item whose subsequence contains xj gets discarded. Let 
v be the subsequence encoded by the stack item at that point. By Fact [5] v also contains X{ and, since it is 
unbalanced at that height, from Fact [3] the probability that v passes the hash test in Algorithm [2] is at most 
n~ c , for a random uniform choice of a, so the algorithm is correct with probability 1 — n~ c . □ 

3.2 The two-pass algorithm 

The second algorithm depends on a parameter k = [log n] , where n is the size of the input word w. We 
assume that without lost of generality, n = 2 k . We achieve this by padding: we append to w the word 
(aa) 1 of suitable length (assuming that w is of even size, otherwise w Dyck(2)). This requires that we 
to store, after the first pass, the number of letters 2i we added. This uses only 0(log n) bits of memory This 
assumption is crucial for the analysis, since the algorithm uses a hierarchical decomposition of the stream 
into nested blocks and the assumption guarantees the same decomposition, whether we read it from left to 
right or from right to left. 

An important implicit convention we make, is that during the right to left pass, letters a, b are resp. 
interpreted as a, b (and vice-versa). 

As before, we use a stack data structure. Each stack item contains values h that have been obtained by 
summing hash(xj) for some j's in a subsequence v, along with auxiliary information I = height (u). In 
addition, we append to each stack item the index of the first letter in v, denoted first (u). 

Algorithm 3 Bi-directional algorithm 
S <— empty stack 

Algorithm [4] with parameters (k, S), reading the stream from left to right { k = [logn] } 

if S is not empty, reject: "missing closing parenthesis" 

Algorithm [4] with parameters (k, S), reading the stream from right to left 

{In the right to left pass, letters a, b are resp. interpreted as a, b (and vice-versa)} 

return accept 



Algorithm [4] recursively decomposes the stream into blocks (Figure [2] in Appendix [A]) . An i-block is a 
substring of the stream of the form x[(q — l)2 l + 1, q2' L ] for 1 < q < n/2 l . The main difference between 
Algorithm S) and Algorithm ^S) is that whenever the algorithm reaches the end of a block, it com- 
presses without checking the stack items encoding subwords from within the block. This compression is 
what reduces the stack size from yfn down to O(logn), but we pay for that in terms of accuracy. Since 
hash is commutative we lose information. For example compressing hash(aa6) with h.ash.(bbba) gives 
h.ash.(aabbbba) = h.ash.(aaabbbb): one word is ill-formed, the other one is well- formed, but after compress- 
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ing we can no longer distinguish between them. The crux of the analysis is that such critical information 
loss cannot occur both when reading the stream from left to right and when reading it from right to left 
(Fact[7]below, and Figure [3]in Appendix |A|) . 

Algorithm 4 Block algorithm (i, stack S) { reads 2* letters in block Bi, increases stack size by at most 1 } 
for j *— 1 to 2 do 
if i > 1 then 

{read recursively two (i — 1) -blocks B and B'} 
Algorithm gfi - 1, S) 
else 

read one letter y 

if y is an upstep then 

push(hash(y), 1, first (y)) on S 
{(hash(y), 1, first (y)) encodes y} 
else 

pop (h, £, f) from 5 (if empty, reject: "negative height") 

{(h,£, f) encodes some subsequence v: h = hash(v), £ = height (v), / = first (v)} 

£*-£-l and h^h + hash(y) 

push (h, £, /) on S 

{(h, £, f) now encodes vy} 

if £ = then 

check that h = (if not, reject: "mismatched parentheses") 
pop and discard (h, £, f) 
end if 
end if 
end if 
end for 

if S has, at the top, two items whose first letters are in Bi then 

{there are at most two such items ui,«2> that moreover are contiguous at the top} 
pop (h 2 ,£ 2 , f2) from S and then pop (hx,h,h) {/i, f 2 are in Bi} 

{(hi,£i, /i) encodes v\, (h 2 , £ 2 , f 2 ) encodes v 2 , and each index in v\ is smaller than all the ones in 

compress: push (h\ + h 2 , l\ + £ 2 , fi) on S 
end if 



First, observe that Fact |4] remains valid for Algorithm |3j Using this fact, we derive the following two 
invariants of Algorithm [3] that are similar to Fact [5] but more involved. 

Fact 6. Let (h, £, f) be a stack item encoding some subsequence v ofx. Then h = hash(-u), £ = height (w) > 
and f = first(t'). For every downstep j E v there is an upstep i S v such that (i, j) is a matching pair for 
x. Moreover £ = only holds for an item at the top of the stack. 

Fact 7. For every j G [n], after reading Xj and completing its processing, each stack item (h, £, f) with f < 
j satisfies height(x[l, /]) < height (x[l, j]) if Xj is an upstep, and height /]) < height (a: [1, j — 1]) if 
downstep. 

We now state a simple observation from the definition of matching pairs. 



9 



Fact 8. Let u,u',v be subsequences of x such that v = uu'. Then, for every possible height d, there is at 
most one matching pair in u x v! at height d. 

We conclude with the correctness of our algorithm. 

Theorem 2. Algorithm |3] is a bidirectional two-pass randomized streaming algorithm for Dyck(2) with 
space 0((log n) 2 ) and time polylog(n). If the input belongs to Dyck(2) then the algorithm accepts it with 
probability 1; otherwise it rejects it with probability at least 1 — n~ c . 

Proof. In terms of space requirements, each stack element takes space O(logn) and the stack has size at 
most 2k = 21ogn, hence space 0((logn) 2 ). The processing time is easy by inspection, while noticing by 
induction that each execution of Algorithm [4] generates only one new stack item. 

To analyze the algorithm, observe (using Fact[6]) that it is correct whenever x G Dyck(2). 

Now, assume that x G" Dyck(2). By Fact|2j either some prefix of x has negative height: then as before 
the algorithm is correct. Or, the final height of x is non-zero: then as before the algorithm is correct. Or, 
there is a matching pair at some height d where x is not well-formed. Consider that case. Let i be 

such that Xj and Xj> are in different (i — l)-blocks B and B' but in the same i-block B{. Assume further 
that among badly formed pairs, (j, f) has been chosen so that i is minimum (see Figure|5]in Appendix |A"|). 
Let m be the minimum of height (x[l, I]), where I ranges over B such that xi is an upstep. Similarly, let m! 
be the minimum of height (x[l, I — 1]), where I ranges over B but now such that x\ is a downstep. 

Without loss of generality (up to reversing left-to-right and right-to-left directions) assume that m > m' . 
Indeed if m < m' , let x denote the reverse string x, where letters a, b are resp. interpreted as a, b (and 
vice-versa). Then x = x[l,l]x[l,n — I] for every I, and blocks B,B' resp. become (re + 1 — B) = 
{n + 1 — I : I G B} and (n + 1 — B'). Moreover, since upsteps become downsteps, and conversly, 
we get that height {x [1, 1]) = height (x [1, n]) + height(x[l,n — I]). Thefore, m is also the minimum of 
height I — 1]), where I ranges over (n + 1 — B) such that x\ is an downstep. Similarly, m' is the 
minimum of height(x[l, I]), where I ranges over (n + 1 — B') such that xi is an upstep. 

After reading B, since j is not yet matched, the stack necessarily contains an item corresponding to a 
word containing j; moreover, since all compressions in B involve items with first letter in B, the first letter 
of that word is in B. From Fact|7j by the end of reading B' that item has been discarded. Let (h, £, /) be 
that item with its corresponding encoded subsequence v. Since the first letter / of v is in B, all of the letters 
of v are in B U B'. By Fact|6j v is a matching set, and, by Fact [8] its matching pairs in B x B' are all at 
different heights. So at height d, v only contains which is not well-formed, plus possibly some pairs 

coming from B x B or from B' x B', pairs that are all well-formed by minimality of i. Overall at height d 
the word v is unbalanced, so by Fact [6j the probability that v passes the hash test of Algorithm |4]is at most 
n~ c , for a uniformly random choice of a. So the algorithm is correct with probability 1 — n~ c . □ 

4 Lower bounds 

We define a family of hard instances for Dyck(2) as follows. For any word Z £ {a, b} n , let Z be the 
matching word associated with Z. For given m, n, consider the following instances of length Q(mn): 

w = X\f{c\c\f\ X2Y2C2C2Y2 . . . X m Y m 

where for every i, Xi G {0, l} n , Y\ = Xi[n — k{ + 2, n] for some ki G {1, 2, . . . , n}, and a G {a, b}. The 
word w is in Dyck(2) if and only if, for every i, q = Xi[n — ki + 1]. 

Intuitively, for m = re/ log n recognizing w is difficult with space o(re) because, after reading Xi, 
the streaming algorithm does not have enough space to store information about the bit at unknown index 
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(n — ki + 1), so when it reads q it is unable to decide whether a = Xi[n — ki + 1]; and after reading 
y m it does not have enough space to store information about all indices ki, k 2 , ■ ■ ■ , k m , so when it reads 
X m . . . X 2 X\ it misses out on its second chance to check whether q = Xi[n — ki + 1] for every i. The 
proof contains some subtleties and is executed in the language of communication complexity. 

We define a communication problem AsCENSlON(m) (Figure [5] in Appendix [b} associated with the 
hard instances described above. For convenience, we replace suffixes by prefixes. Formally, in the problem 
AsCENSlON(m) there are 2m players Ai,A 2 , A m and Bi,B 2 , B m . Player A, has Xj G {0, l} n , 
Bi has ki G [n] , a bit Cj and the prefix Xi [1, A; — 1] of Xj. Let X = (X\ , X2 , . . . , X m ), k = (k\,k 2 , . . . ,k m ) 
and c = (ci, c 2 , . . . , c m ). The goal is to compute / m (X, k, c) = \l™ =1 /(X;, h, a) = V™i(^#i] © c 0> 
which is if » [fej] = Q for all i, and 1 otherwise. 

Motivated by the streaming model, we require each message to have length at most size bits, where the 
parameter size is a function of m and n and corresponds to the allowed space in the corresponding streaming 
algorithm. We also require the communication between the 2m participants in a one-pass protocol to be in 
the following order: 

Round 1 

For i from 1 to m — 1, player Ai sends message M^. to Bi, then Bi sends message Mb 1 to Ai + \, then 
A m sends message M^ m to B m ; then 
Round 2 

B m sends message M# m to then 

For i from m down to 2, Aj sends a message M' A . to Aj_i; then 
A\ computes the output. 

To establish the hardness of solving AsCENSlON(m), we prove a direct sum result that captures its 
relationship to solving m instances of a "primitive" problem MOUNTAIN defined as follows. In the problem 
Mountain (Figure|4]in Appendix[B|), Alice has an n-bit string X G {0, 1}™, and Bob has an integer k G [n], 
a bit c and the prefix X [1 , k — 1] ofX. The goal is to compute the Boolean function f(X, k, c) = (X[k](Bc) 
which is if X[k] = c, and 1 otherwise. In a one-pass protocol for MOUNTAIN, the communication occurs 
in the following order: Alice sends a message M4 to Bob, Bob sends a message Mb to Alice, then Alice 
outputs f(X, k,c). 

As mentioned in Section [T] we follow the "information cost" approach, a method that has been partic- 
ularly successful in recent works on direct sum results. The method comes in a variety of flavours, each 
crafted to suit the application at hand. We describe the approach as adapted for AsCENSlON(m). Infor- 
mation cost is often defined in terms of the entire input and the full transcript of the protocol. We enforce 
both the nature of streaming algorithms and of our problem, by restricting our attention to only one mes- 
sage Ms m from the transcript. We also split the input in two parts, and measure the information in the 
message Mg m about one part (k, c), conditioned on the other part X. In our case, the conditioning corre- 
sponds to information that is in the hands of the subsequent players. The closest such measures, of which 
we are aware, were considered in |[T7l l5t. 

The direct sum result is proven using the superadditivity of mutual information for inputs (ki, q) picked 
independently from a carefully chosen distribution. In the defining information cost, we measure mutual 
information with respect to a distribution on which the MOUNTAIN function is the constant 0, eventhough we 
consider protocols for the problem that are correct with high probability in the worst case (or, equivalently, 
when the inputs are chosen from a "hard" distribution). The use of this easy distribution collapses the 
function AsCENSlON(m) to an instance of MOUNTAIN in any chosen coordinate. We massage this (already 
established) technique into a form that is better suited to the streaming model and to proving lower bounds 
for the primitive function MOUNTAIN. 
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We finish by giving a combinatorial argument that protocols computing MOUNTAIN in the worst case 
necessarily reveal "a lot" of information even when its inputs are chosen according to the easy distribution. 
Privacy loss, a measure similar to information cost, has been studied previously in protocols for Index 
(see, e.g., |[T6l Q31 and the references therein). Although this communication problem is closely related 
to Mountain, prior works study Index under hard distributions, and do not seem to extend directly to our 
case. 

4.1 Information cost 

We measure the information cost of a one-pass public-coin randomized protocol P for AsCENSlON(m) 
(of the form described in the previous section), with respect to some distribution v on the inputs (X, k, c), 
by 1C V (P) = I(k, c : Mb to |X, R), where R denotes the public-coins of P. From this we define the 
information cost of the problem AsCENSlON(m) itself with respect to a distribution v and error parameter 5 
as follows: ICP ub (AsCENSlON(m), 5) = min (lC y (P)), where the minimum is over one-pass public-coin 
randomized protocols P for the problem, with worst-case error at most 5. Note that the information cost 
implicitly depends on the length size of each message. 

For the problem MOUNTAIN we play a subtle game between public and private coins. We consider 
protocols in which Alice has access only to public coins R, whereas Bob additionally has access to some 
independent private coins Rb- We define IC V (P) = l(k, c : Mb\X, R), where R denotes only the public- 
coins of P. Further, we define IC™ 1x (Mountain, S) = min (IC U (P)), where P ranges over "mixed" 
public and private coin randomized protocols with worst case error at most 5 where Alice and Bob share 
public coins, and only Bob has access to extra private coins. 

We also make use of a related measure of complexity for MOUNTAIN when P ranges over 
protocols where Alice's message is deterministic, and Bob has access to private coins Rb- 
D I C™ 1 * (MOUNTAIN, fj,, 5) = min (IC U (P)), i.e., the minimum information cost with respect to v, where 
P ranges over protocols for MOUNTAIN, in which Alice's message Ma is deterministic given her input X, 
while Bob may use his private coins Rb to generate his message. Further, the distributional error of P is 
at most 5 when the inputs are chosen according to \x. Note that in general, and certainly in our application, 
v and n may be different, meaning that we measure the information cost of the protocol with respect to 
some distribution v, while we measure its error under a potentially different distribution \x. For later use, we 
recall that the distributional error under \i is ^xp/x,k,c)~fi ( P r (-P fails on (X, k, c))), where the probability 
is over the private coins Rb of Bob. 

We begin by relating the information cost for protocols in which Alice is deterministic to that of mixed 
randomized protocols. 

Lemma 1. DIC™ ix (MouNTAiN, /j,, 25) < 2 x IC^MOUNTAIN, £). 

Proof. Consider a randomized protocol P for MOUNTAIN with worst-case error at most 5 such that 
IC™ 1X (Mountain, 5) = IC U (P). We further assume that Alice and Bob have public coins R, and only 
Bob has extra private coins Rb- Then 



IC™ lx (MOUNTAIN,(5) 



E W (l(k,c:M Bm \X,R = r)), 



r 



Since P has worst-case error at most 5, it has distributional error at most 5 under fj,: 




< 5. 
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Therefore, by the Markov inequality, there is a set 1Z with Pr(r G 1Z) > \ such that 

VrGft Exp (Pr(Pfailson(X,A; ) c)| J R = r)) < 28. 

Now consider the information cost of P under the distribution v over inputs. We have 

Exp (l(fe,c: M B JX,R = r)) < 2 x IC™ ix (MouNTAlN, 5), 

ren 

since 7£ has probability mass at least 1/2. Therefore, there exists an r G TZ such that I(fe, c : Ms m \X, R = 
r) < 2 x IC™ 1X (M0UNTAIN, 5). Let P r be the protocol obtained by fixing the public coins used in P to r. 
Then Alice's message Ma is deterministic. By definition of 1Z, the protocol P r has distributional error at 
most 28 under \i, and \Q V [P) < 2 x IC™ ix (MouNTAlN, 5). □ 

4.2 Information cost of Mountain 

As explained beore, and formally proved in the next section, the information cost approach entails showing 
that the MOUNTAIN problem is "hard" even when we restrict our attention to an easy distribution. We prove 
such a result here. 

Let /i be the distribution over inputs (X, fe, c) in which X is a uniformly random ra-bit string, A; is a 
uniformly random integer in [n] and c a uniformly random bit. This is a hard distribution for MOUNTAIN 
(as is implicit in GUI O). We consider information cost of MOUNTAIN under the distribution fiQ obtained 
by conditioning fi on the event that the function value is 0: fio(X, fe, c) = ^(X, k, c\X[k] = c). 

Lemma 2. //size < n/100, then DIC™ X (M0UNTAIN, fj,, l/16n 2 ) = fi(logn). 

Proof. Let P be a randomized protocol for MOUNTAIN, where Alice's message Ma is deterministic, with 
distributional error at most l/16n 2 under the distribution u, such that \Ma\ < n/100. We prove that 
IC A10 (-P) = f2(logn). In the following, all expressions involving mutual information and entropy are with 
respect to the distribution ^o- 

By Markov inequality, there are at least 2™ _1 strings U on which P fails with error at most l/8n 2 on 
average on input (U, k, c), where (k, c) varies uniformly. Let S C {0, l} n of size at least 2 n ~ 1 be the set of 
such strings U. Then P has error probability less than l/4n on input (U,k,c), for every pair (k, c). 

Let a be a possible message Ma from Alice to Bob when her inputs range in S, and let S a = {U E 
S : Ma(U) = a}. For every string V G S a , we bound from below the mutual information of k and Mb, 
the randomized message that Bob sends back to Alice, as k varies. For this we construct a set I C [n] such 
that the message distributions = Mb(o, V[l, k — 1], k, V[k]) for k G / are pairwise well-separated 
in l\ distance. This is in turn established by exhibiting, for each fe € i, a string G S a such that 
V[l, k - 1] = V k [l, k-1] and V[k] ^ ^[fc]. The details follow. 

Associate with S a its dictionary T, a 2-rank tree (a tree with either 1 or 2 children at any internal node), 
all whose nodes except the root are labeled by bits; the root has an empty label. Each string V in S a is in 
one-to-one correspondence with a top-down path it in T from the root to one of its leaves, where the label 
of the (i + l)th node in tt is V[i). We identify V G S a with the path 7r in T, and refer to this path as V. 

The tree T has \S a \ leaves, each at depth n. For a fixed V G S a , let / be the set of integers k such that 
the (k + l)th node in path V has out-degree 2. By construction, for every k G / there exists another string, 
say, V k G S a such that V[l, k - 1] = F fc [l, fe — 1] and V[Jfe] / Vfe[fe]. 

Set Cfc = V[k] for every fe G [n]. Then the message distributions satisfy Mg(a, V[l, fe — 1], fe, c^) = 
Mb(o>, Vfc[l, fe — 1], fe, Cfc), for all fc G I. Let = Mg(a, V[l, fe — 1], fe, c/c). Let fe, fe' G I be distinct 
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indices such that A; < k'. As Vk'[l, k'— 1] = V[l, k'— 1], the message distribution Mgfo, V^[l, k— 1], fc, c^) 
on input (T4',/c,Cfc) equals mjt, and also Mg(a, V^/ [1, k' — l],k',cy) on input (Vy,k',cy) equals my. 
However, Vy [k] =V[k] = Ck, so the function evaluates to on input (Vy, k, Ck), and Vy[ k'] ^ V[k'] = cy, 
so the function value is 1 on (Vy, k' , cy). The protocol P computes its outputs from nik, Vy and my, Vy, 
respectively, on these instances, and errs with probability at most I /An. 

We use the above property of the distributions to lower bound the mutual information of k in the mes- 
sage Mb, given V. 

(41 1\ — ra \ 
-4 log n - 2. 
An J 

(We prove this below.) 

Next, we observe from the properties of 2-rank trees that the number of strings V G S a for which |/| = I 
is at most 2 l . The number of V for which |/| < Z — 2 is therefore at most 2 l ~ 1 . Now fix I = log \S a \, and 
note that the proportion of V 6 S a with |/| > I — 1 is at least 1 /2. Therefore ~Exp VeSa \I\ > -jp. 

We now concentrate on the messages a such that Pry uniform (Ma.(X) = a\X G S) > 2 _n / 10 . Then 
I = log \S a \ > n— 1— n/10 = 0.9n— 1, and for n large enough, Exp V€Sa H(fc|Ms, X = V) < 2+^j logra, 
and therefore Exp VgSa (I(A;, c : M B |X = V)) > ^ logra -2. 

The net probability of messages a which have probability at most 2 _n//10 given X G S is at most 
2 n/ioo 2 -n/io = 2 -9n/io > which is neg iigibl e . Therefore we conclude that l(k, c : M B \X) = 0(log ra). □ 

Proo/o/'Facf[2] Fix a string V, and the corresponding set of indices /. Suppose we are given as input a 
distribution rai = m&, for some k G I. We recover A; using the following procedure IT: 

1. For each k' G /, simulate the Alice's computation of the output in the protocol P, by setting Mb = m, 
the input distribution, and X = Vy. 

2. Let (dy ) y el De the sequence of outputs Alice generates from the above simulation. Output the largest 
k' for which dy = 1. This is our guess for k. 

On input my the simulation of P above generates dk = 1, and d^/ = for k' > k, with probability at least 
1 — l/4n for any fixed k' > k. Therefore, the procedure outputs k' = k with probablity at least 3/4. 

We now argue that the entropy of k is significantly reduced when given Mg, X, under the distribution uq 
(i.e., when Ck = X[k}). This is equivalent to saying that the mutual information of k and Mb is high. When 
the inputs are picked according to the distribution uq, we have 

I(k,c : M B \X = V) = R{k\X = V) -R(k\M B ,X = V) = logn - R(k\M B ,X = V). 

We bound from above the conditional entropy H(fc|Me, X = V). We first separate the values of k G" / as 
follows. Let p = \I\/n, and define the Boolean random variable J as 1 iff k G I. We have 

R{k\M B ,X = V) = R{kJ\M B ,X = V) 

= H(J\M B ,X = V) + tt(k\M B ,X = V,J) 

= H(p) + (1 - p)H(k\M B ,X = V,k#I) + P H(k\M B ,X = V,k€l) 

< l + (l-p)logn + R(k\M B ,X = V,ke I) 

< l+(l-p)logn + R(k\K,X = V,keI), 

where K is the random variable computed by our finding procedure II, and the final step follows from 
the Data Processing Inequality. For any fixed k G /, given Mb the procedure IT computes K = k with 
probability at least 3/4. By the Fano Inequality, we have 

H(k\K,X = V,k G /) < H(i) + ^log(|/|-l) < 1 + ^logn. 
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□ 



By combining Lemmas [T] and [2] we get 



Theorem 3. IC™ ix (MouNTAiN, l/42n 2 ) = fi(logn). 

4.3 Reduction to from AsCENSiON(m) to MOUNTAIN 

We now study the information cost of AsCENSlON(m) for the distribution /x™ over ({0, l} n x [n] x {0, l}) m 
for X = (Xi, Xi, . . . , X m ), k = (k\, k2, ■ ■ ■ , k m ) and c = (c%, C2, . . . , c n ). We state a direct sum property 
that relates this cost to that of one instance of MOUNTAIN, and then conclude. 

Lemma 3. ICjm (AsasNSiON(m), 8) >mx IC™ ix (Mountain, 5). 

Proof. Let P be a public-coin randomized protocol for Ascension(to) with worst-case error 5 such that 
IC^m(P) = ICP™ b (AsCENSION(m),5). 

From P, we construct the following protocol P- for MOUNTAIN, where j G [n]. Let (X, k, c) be the 
input for MOUNTAIN. 

1. Alice sets Aj's input Xj to its input X. 

2. Bob sets Bj's input (kj,Xj[l, kj — 1], Cj) to its input (k, X[l, k — 1], c). 

3. Alice and Bob generate, using public coins, (Xj, ki, Cj) according to independently for all % < j, 
and Xi uniformly independently for i > j. 

4. Bob generates (ki) uniformly independently for i > j, but using his private coins. Then Bob sets 
Q = Xj[/cj] for i > j (so that (Xi, ki,Ci) are now according to fiQ, independently for all i < j). 

5. Alice and Bob run the protocol P by simulating the players (A4, Bi) 7 ^] =1 as follows: 

(a) Alice runs P until she generates the message M^ . from player Aj. She sends this message to 



(b) Bob continues running P until he generates the message Ms m from player B m . He sends this 
message to Alice. 

(c) Alice completes the rest of the protocol P until the end, and produces as output for P'-, the output 
of player A\ in P. 

By definition of the distribution /xq> we nave f(Xi, ki, Cj) = for all i ^ j. So / m (X, k, c) = f(X, k, c), 
and each protocol P'- computes the function /, i.e., solves MOUNTAIN, with worst-case error 5. 

We prove that IC^^P) = ^ . IC Mo (Pj), which implies the result, since only Bob uses private coins in 



Let R denote the public coins used in the protocol P. By applying the chain rule to IC^(P), we get 



Bob. 




I(k,c:M B JX,R) 





which is the same as l(kj, Cj : Ms m |X, k\,c\,..., kj-\,Cj-\, R), the direct sum result follows. 



□ 



15 



Theorem 4. Let P be a public-coin randomized protocol for AsCENSlON(n/ log n) with worst-case error 
probability l/42n 2 , then size = f2(n). 

Proof. Let m = n/logn and S = l/42n 2 , and let P be a public-coin randomized protocol for 
AsCENSlON(m) with worst-case error probability 5. Obviously, IC^(P) is at most size. On the other 

hand, by definition IC^m b (AsCENSlON(m), 5) is less than or equal to IC^(P). By Lemma |ij we have 
ICjJ b (ASCENSION(m), 5) > m x IC™ ix (MOUNTAIN, 5). By Theorem^ we get IC™ ix (M0UNTAlN, 5) = 
O(logn). Combining yields size = fi(mlogn) = O(n). □ 

Corollary 1. Every one-pass randomized streaming algorithm for the matching parenthesis problem of 
words of length n! with ( two-sided) error 0(l/n' log n') uses 0(i/r? log n') space. 

Proof. Assume we have a one-pass randomized streaming algorithm for the matching parenthesis problem 
of words of length n' with (two-sided) error 0(1 /n' log n 1 ) uses Q(yjn' log n') space. Then, by the discus- 
sion at the beginning of Section |4j there is a public-coin randomized protocol for AsCENSlON(n/logn) 
with n = Q(^n' log n') and with worst-case error probability l/42n 2 . By Theorem [4j the mes- 
sages must have size 0(n), and therefore, by the discussion, the streaming algorithm must have space 
0(n) = tty/n> 'log ri). □ 
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A Example of execution of Algorithms 

Figure [T] shows an example of execution of our one-pass algorithm. Here there are eight blocks, and they 
are shown after the internal simplifications have already been done. The dotted vertical lines mark times at 
which the stack changes size, either starting a new stack item (for example, at time to) or discarding a stack 
item (for example, at time t^). Note that blocks and stack items are staggered: the first item incorporates the 
first block and the downsteps of the second block, the second item incorporates the upsteps of the second 
block and the downsteps of the third block, etc. The bullets mark times when the algorithm checks and 
discards an item. The horizontal lines go from the time when a stack item is created to the time when it is 
checked and discarded. For example, at time tj the algorithm checks and discards an item (h m , £ m ), (d, Id) 
such that h m incorporated the upsteps marked in bold on the figure, namely x(t\, t^], and d incorporated the 
downsteps marked in bold on the figure, namely xfo, £3], x(t&, £5] and x(t§, t?]. 



and 




Figure 1: Example of execution of Algorithm [T| 

Figure [2] illustrates the logarithmic block decomposition of the input word into all the blocks that will 
be activated during one -pass. They are identical from the left-to-right pass and the right-to-left pass since 
thanks to padding the input length is a power of 2. At every instant, only one f-block is activated for each i. 

Figure [3] gives an intuition of the proof of Fact [7] The bold-face lines represent matching pairs between 
the two {i — 1) -blocks B, B' within the same i -block Bi. In the case of the figure, those pairs are checked 
during the left-to-right pass, since the minimum height m within the left {% — l)-block B is larger than 
the minimum height m' with the right (i — 1) -block B' (during the right-to-left pass, they are compressed 
without any checks when Bi is processed). 



B Figures for Mountain and AscENSiON(m) 

Figure [4] presents an input stream with its division between players Alice and Bob. The horizontal axis 
represents the length of the stream seen so far, and the vertical axis represents the corresponding height. We 
introduce a potential mismatch denoted by letter c in Bob's input. 
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Figure 2: Decomposition in block-structure 
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Figure 3: Illustration of Fact |7j for an example of execution of Algorithm |3j 

Figure [5] presents the m-fold nesting of the above stream. The stream is now divided between 2m 
players. There are m potential mismatches each caused by the letter Cj in B^s input. 
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Bob 




Figure 4: Problem MOUNTAIN: Y[l, k - 1] = X[l, k - 1]. The word is well-formed if and only if c 




